Automating Physical Database Design: An Extensible Approach
نویسندگان
چکیده
In a high-level query language such as SQL, queries yield the same result no matter how the logical schema is physically implemented. Nevertheless, a query's cost can vary by orders of magnitude among di erent physical implementations of the same logical schema, even with the most modern query optimizers. Therefore, designing a low-cost physical implementation is an important pragmatic problem|one that requires a sophisticated understanding of physical design options and query strategies, and that involves estimating query costs, a tedious and error-prone process when done manually. We have devised a simple framework for automating physical design in relational or post-relational DBMSs and in database programming languages. Within this framework, design options are uniformly represented as \features", and designs are represented by \con ict"-free sets of features. (Mutually exclusive features con ict. An example would be two primary indexes on the same table.) The uniform representation of design options as features accommodates a greater variety of design options than previous approaches; adding a new design option (e.g. a new index type) merely entails characterizing it as a feature with appropriate parameters. We propose an approximation algorithm, based on this framework, that nds low-cost physical designs. In an initial phase, the algorithm examines the logical schema, data statistics, and queries, and generates \useful features"|features that might reduce query costs. In a subsequent phase, the algorithm uses the DBMS's cost estimates to nd \best features"|features that belong to the lowestcost designs for each individual query. Finally, the algorithm searches among con ict-free subsets of the best features of all the queries to nd organizations with low global cost estimates. We have implemented a prototype physical design assistant for the INGRES relational DBMS, and we evaluate its designs for several benchmarks, including ASSSAP. Our experiments with the prototype show that it can produce good designs, and that the critical factor limiting their quality is the accuracy of query cost estimates. The prototype implementation isolates dependencies on INGRES, permitting our framework to produce design assistants for a wide range of relational, nested-relational, and object-oriented DBMSs. To my parents, Barbara L. Rozen and Jerome G. Rozen, Jr., and to my daughter, Kate.
منابع مشابه
Automating Physical Database Design Automating Physical Database Design: An Extensible Approach
In a high-level query language such as SQL, queries yield the same result no matter how the logical schema is physically implemented. Nevertheless, a query's cost can vary by orders of magnitude among di erent physical implementations of the same logical schema, even with the most modern query optimizers. Therefore, designing a low-cost physical implementation is an important pragmatic problem|...
متن کاملDatplex: an Extensible Design-aid Tool for Physical Database Design
In this paper, we describe DATPLEX, an extensible design-aid tool for physical database design. DAT-PLEX adopts a layered approach where each layer improves over its input design; and is \extensible" in the sense that an additional layer can be \plugged" into the tool, or an existing layer can be removed from the tool. The current implementation comprises three layers that improve the logical s...
متن کاملAutomating Semantic Blogging
Blogging is an easy and widely usable tool for data exchange and knowledge management. But the data in blogs are limited to the ‘followers’ (human readers) and not easily available for machine processing to enable information aggregation or further analysis. Thus, there is a need to enrich blog content with additional semantics using metadata. This paper proposes an approach for automating sema...
متن کاملAn Extensible Object-Oriented Approach to Databases for VLSI/CAD
This paper describes an approach to the specification and modeling of information associated with the design and evolution of VLSI components. The approach is characterized by combined structural and behavioral descriptions of a component. Database modeling requirements specific to the VLSI design domain are considered and techniques t.o address them are described. An extensible object-oriented...
متن کاملText Search in an NFS-Proxy: A Case Study in Extensible File Systems
This paper describes the design of an extensible 3-tiered semantic file system, backed by an existing extensible object-relational database. The system is designed to export the standard NFS interface, while providing indexing and query support for user-defined file types using the virtual directory abstraction. To illustrate the feasibility of the proposed architecture, we describe its impleme...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993